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The invention relates to expression systems and methods for 
expression of desired genes and gene products in cells. 
Particularly, the invention relates to a gene encoding a RNA 
10 binding protein useful for regulating gene expression in cells, 
the protein binding site, a gene encoding a regulating protein 
disulfide isomerase and methods and systems for gene expression 
of recombinant molecules. 



Expression systems for expression of exogenous foreign 
genes in eukaryotic and prokaryotic cells are basic components 
of recombinant DNA technology. Despite the abundance of 
expression systems and their wide-spread use, they all have 



20 characteristic disadvantages. For example, while expression in 
E. coli is probably the most popular as it is easy to grow and 
is well understood, eukaryotic proteins expressed therein are 
not properly modified. Moreover, those proteins tend to 
precipitate into insoluble aggregates and are difficult to 

25 obtain in large amounts. Mammalian expression systems, while 

practical on small-scale protein production, are more difficult, 
time-consuming and expensive than in E . coli. 

A number of plant expression systems exist as well as 
summarized in US Patent No. 5,234,834, the disclosures of which 

30 are hereby incorporated by reference. One advantage of plants 
or algae in an expression system is that they can be used to 
produce pharmacologically important proteins and enzymes on a 
large scale and in relatively pure form. In addition, 
micro-algae have several unique characteristics that make them 

35 ideal organisms for the production of proteins on a large scale. 
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First, unlike most systems presently used to produce transgenic 
proteins, algae can be grown in minimal media (inorganic salts) 
using sunlight as the energy source. These algae can be grown 
in contained fermentation vessels or on large scale in monitored 
ponds. Ponds of up to several acres are routinely used for the 
production of micro-algae. Second, plants and algae have two 
distinct compartments, the cytoplasm and the chloroplast, in 
which proteins can be expressed. The cytoplasm of algae is 
similar to that of other eukaryotic organisms used for protein 
expression, like yeast and insect cell cultures. The 
chloroplast is unique to plants and algae and proteins expressed 
in this environment are likely to have properties different from 
those of cytoplasmically expressed proteins. 

The present invention describes an expression system in 
which exogenous molecules are readily expressed in either 
prokaryotic or eukaryotic hosts and in either the cytoplasm or 
chloroplast. These beneficial attributes are based on the 
discovery and cloning of components of translation regulation in 
plants as described in the present invention. 

Protein translation plays a key role in the regulation of 
g ene expression across the spectrum of organisms (Kozak, Ann , 
Rev . Cell Biol . . 8:197-225 (1992) and de Smit and Van Duin, 
Proa. Nucleic Acid Res. Mol . Biol. , 38:1-35 (1990)). The 
majority of regulatory schemes characterized to date involve 
translational repression often involving proteins binding to 
mRNA to limit ribosome association (Winter et al . , Proc . N atl . 
Acad. Sci . . USA , 84:7822-7826 (1987) and Tang and Draper, 
Biochem. , 29:4434-4439 (1990)). Translational activation has 
also been observed (Wulczyn and Kahmann, Cell . 65:259-269 
(1991) ) , but few of the underlying molecular mechanisms for this 
type of regulation have been identified. In plants, light 
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activates the expression of many genes. Light has been shown to 
activate expression of specific chloroplast encoded mRNAs by 
increasing translation initiation (Mayfield et al . , Ann . Rev . 
Plant Physiol. Plant Mol . Biol. , 46:147-166 (1995) and Yohn et 
al . , Mol . Cell Biol . . 16:3560-3566 (1996)). Genetic evidence in 
higher plants and algae has shown that nuclear encoded factors 
are required for translat ional activation of specific 
chloroplast encoded mRNAs (Rochaix et al . , Embo J . . 8:1013-1021 
(1989), Kuchka et al . , Cell . 58:869-876 (1989), Girard-Bascou et 
al., Embo J . . 13:3170-3181 (1994), Kim et al . , Plant Mol . Biol . . 
127 : 1537-1545 (1994) . 

In the green algae Chlamydomonas reinhardtii, a number of 
nuclear mutants have been identified that affect translation of 
single specific mRNAs in the chloroplast, often acting at 
translation initiation (Yohn et al . , supra . (1996)). Mutational 
analysis of chloroplast mRNAs has identified sequence elements 
within the 5* untranslated region (UTR) of mRNAs that are 
required for translat ional activation (Mayfield et al . , supra , 
(1995), Mayfield et al . , J. Cell Biol . . 127:1537-1545 (1994) and 
Rochaix, Ann. Rev, Cell Biol . . 8:1-28 (1992)), and the 5' UTR of 
a chloroplast mRNA can confer a specific translation phenotype 
on a reporter gene in vivo (Zerges and Rochaix, Mol . Ce ll Biol . , 
14:5268-5277 (1994) and Staub and Maliga, Embo J. . 12:601-606 
(1993) . 

Putative translational activator proteins were identified 
by purifying a complex of four proteins that binds with high 
affinity and specificity to the 5' UTR of the chloroplast 
encoded psJbA mRNA [encoding the Dl protein, a major component of 
Photosystem II (PS II)] (Danon and Mayfield, Embo J . , 10:3993- 
4001 (1991)). Binding of these proteins to the 5' UTR of psJbA 
mRNA correlates with translation of this mRNA under a variety of 
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physiological (Danon and Mayfield, id. . (1991)) and biochemical 
conditions (Danon and Mayfield, Science, 266:1717-1719 (1994) 
and Danon and Mayfield, Embo J . , 13:2227-2235 (1994)), and in 
different genetic backgrounds (Yohn et al . , supra . (1996)) . The 
5 binding of this complex to the psbA mRNA can be regulated in 
vitro in response to both redox potential (Danon and Mayfield, 
Science , 266:1717-1719 (1994)) and phosphorylation (Danon and 
Mayfield, Embo J . , 13:2227-2235 (1994)), both of which are 
thought to transduce the light signal to activate translation of 
10 psbA mRNA. The 47 kDa member of the psbA RNA binding complex 
(RB47) is in close contact with the RNA, and antisera specific 
to this protein inhibits binding to the psbA mRNA in vitro 
(Danon and Mayfield, supra , (1991) ) . 



15 been reported, the protein has not been extensively 

characterized and the gene encoding RB4 7 has not been 
identified, cloned and sequenced. In addition, the regulatory 
control of the activation of RNA binding activity to the binding 
site by nuclear-encoded trans-acting factors, such as RB60, have 

20 not been fully understood. The present invention now describes 
the cloning and sequencing of both RB47 and RB60. Based on the 
translation regulation mechanisms of RB47 and RB60 with the RB47 
binding site, the present invention also describes a translation 
regulated expression system for use in both prokaryotes and 

25 eukaryotes. 

Brief Description of the Inve ntion 

The RB47 gene encoding the RB4 7 activator protein has now 
been cloned and sequenced, and the target binding site for RB47 
3 0 on messenger RNA (mRNA) has now been identified. In addition, a 
regulatory protein disulfide isomerase, a 60 kilodalton protein 



Although the translat ional control of psbA mRNA by RB4 7 has 
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referred to as RB60, has also been cloned, sequenced and 
characterized. Thus, the present invention is directed to gene 
expression systems in eukaryotic and prokaryotic cells based on 
translational regulation by RB47 protein, its binding site and 
the RB60 regulation of RB47 binding site activation. 

More particularly, the present invention describes the use 
of the RB47 binding site, i.e., a 5' untranslated region (UTR) 
of the chloroplast psbA gene, in the context of an expression 
system for regulating the expression of genes encoding a desired 
recombinant molecule. Protein translation is effected by the 
combination of the RB47 binding site and the RB47 binding 
protein in the presence of protein translation components. 
Regulation can be further imposed with the use of the RB60 
regulatory protein disulfide isomerase. Therefore, the present 
invention describes reagents and expression cassettes for 
controlling gene expression by affecting translation of a coding 
nucleic acid sequence in a cell expression system. 

Thus, in one embodiment, the invention contemplates a RB4 7 
binding site sequence, i.e., a mRNA sequence, typically a mRNA 
leader sequence, which contains the RB47 binding site. A 
preferred RB4 7 binding site is psbA mRNA. For use in expressing 
recombinant molecules, the RB47 binding site is typically 
inserted 5' to the coding region of the preselected molecule to 
be expressed. In a preferred embodiment, the RB47 binding site 
is inserted into the 5 1 untranslated region along with an 
upstream psbA promoter to drive the expression of a preselected 
nucleic acid encoding a desired molecule. In alternative 
embodiments, the RB47 binding site is inserted into the 
regulatory region downstream of any suitable promoter present in 
a eukaryotic or prokaryotic expression vector. Preferably, the 
RB47 binding site is positioned within 100 nucleotides of the 
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translation initiation site. In a further aspect, 3' to the 
coding region is a 3 ' untranslated region (3' UTR) necessary for 
transcription termination and RNA processing. 

Thus, in a preferred embodiment, the invention contemplates 
5 an expression cassette or vector that contains a transcription 
unit constructed for expression of a preselected nucleic acid or 
gene such that upon transcription, the resulting mRNA contains 
the RB47 binding site for regulation of the translation of the 
preselected gene transcript through the binding of the 

10 activating RB47 protein. The RB47 protein is provided 

endogenously in a recipient cell and/or is a recombinant protein 
expressed in that cell . 

Thus, the invention also contemplates a nucleic acid 
molecule containing the sequence of the RB47 gene. The nucleic 

15 acid molecule is preferably in an expression vector capable of 

expressing the gene in a cell for use in interacting with a RB47 
binding site. The invention therefore contemplates an expressed 
recombinant RB4 7 protein. In one embodiment, the RB4 7 binding 
site and RB47 encoding nucleotide sequences are provided on the 

20 same genetic element. In alternative embodiments, the RB47 

binding site and RB47 encoding nucleotide sequences are provided 
separately . 

The invention further contemplates a nucleic acid molecule 
containing the sequence encoding the 69 kilodalton precursor to 

2 5 RB47. In alternative embodiments, the RB4 7 nucleic acid 

sequence contains a sequence of nucleotides to encode a 
histidine tag. Thus, the invention relates to the use of 
recombinant RB47, precursor RB47, and histidine-modif ied RB47 
for use in enhancing translation of a desired nucleic acid. 

3 0 The invention further contemplates a nucleic acid molecule 

containing a nucleotide sequence of a polypeptide which 
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regulates the binding of RB47 to RB47 binding site. A preferred 
regulatory molecule is the protein disulfide isomerase RB60. 
The RB60-encoding nucleic acid molecule is preferably in an 
expression vector capable of expressing the gene in a cell for 
5 use in regulating the interaction of RB47 with a RB47 binding 
site. Thus, the invention also contemplates an expressed 
recombinant RB6 0 protein. In one embodiment, the RB4 7 binding 
site, RB47 encoding and RB60 encoding nucleotide sequences are 
provided on the same genetic element. In alternative 

10 embodiments, the expression control nucleotide sequences are 
provided separately. In a further aspect, the RB60 gene and 
RB47 binding site sequence are provided on the same construct. 

The invention can therefore be a cell culture system, an in 
vitro expression system or a whole tissue, preferably a plant, 

15 in which the transcription unit is present that contains the 

RB47 binding site and further includes a (1) transcription unit 
capable of expressing RB47 protein or (2) the endogenous RB47 
protein itself for the purpose of enhancing translation of the 
preselected gene having an RB47 binding site in the mRNA. 

20 Preferred cell culture systems are eukaryotic and prokaryotic 
cells. Particularly preferred cell culture systems include 
plants and more preferably algae. 



transcription unit capable of expressing a regulatory molecule, 
25 preferably RB6 0 protein, or (2) the endogenous RB60 protein 
itself for the purpose of regulating translation of the 
preselected gene having an RB47 binding site in the mRNA. In an 
alternative preferred embodiment, one transcription unit is 
capable of expressing both the RB47 and RB60 proteins. In a 
3 0 further aspect, the RB47 binding site sequence and RB60 sequence 
are provided on the same construct . 



A further preferred embodiment includes (1) a separate 
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In one aspect of the present invention, plant cells 
endogenously containing RB47 and RB60 proteins are used for the 
expression of recombinant molecules, such as proteins or 
polypeptides, through activation of the RB47 binding in an 
5 exogenously supplied expression cassette. Alternatively, stable 
plant cell lines containing endogenous RB47 and RB60 are first 
generated in which RB4 7 and/or RB60 proteins are overexpressed . 
Overexpression is obtained preferably through the stable 
transformation of the plant cell with one or more expression 
10 cassettes for encoding recombinant RB47 and RB60. In a further 
embodiment, stable cell lines, such as mammalian or bacterial 
cell lines, lacking endogenous RB47 and/or RB60 proteins are 
created that express exogenous RB47 and/or RB60. 



15 transgenic plant, or a plant in which the genetic elements of 
the invention have been introduced. Based on the property of 
controlled translation provided by the combined use of the RB47 
protein and the RB47 binding site, translation can be regulated 
for any gene product, and the system can be introduced into any 

20 plant species. Similarly, the invention is useful for any 
prokaryotic or eukaryotic cell system. 

Methods for the preparation of expression vectors is well 
known in the recombinant DNA arts, and for expression in plants 
is well known in the transgenic plant arts. These particulars 

25 are not essential to the practice of the invention, and 
therefore will not be considered as limiting. 

The invention allows for high level of protein synthesis in 
plant chloroplasts and in the cytoplasm of both prokaryotic and 
eukaryotic cells. Because the chloroplast is such a productive 

30 plant organ, synthesis in chloroplasts is a preferred site of 
translation by virtue of the large amounts of protein that can 



Plants for use with the present invention can be a 
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be produced. This aspect provides for great advantages in 
agricultural production of mass quantities of a preselected 
protein product. 

The invention further provides for the ability to screen 
for agonists or antagonists of the binding of RB47 to the RB47 
binding site using the expression systems as described herein. 
Antagonists of the binding are useful in the prevention of plant 
propagation . 

Also contemplated by the present invention is a screening 
assay for agonists or antagonists of RB60 in a manner analogous 
to that described above for RB47. Such agonists or antagonists 
would be useful in general to modify expression of RB60 as a way 
to regulate cellular processes in a redox manner. 

Kits containing expression cassettes and expression 
systems, along with packaging materials comprising a label with 
instructions for use, as described in the claimed embodiments 
are also contemplated for use in practicing the methods of this 
invention . 

Other uses will be apparent to one skilled in the art in 
light of the present disclosures. 

ttrlftf Description of Drawings 

In the figures forming a portion of this disclosure: 
Figures 1A-1D show the complete protein amino acid residue 
sequence of RB47 is shown from residues 1-623, together with the 
corresponding nucleic acid sequence encoding the RB47 sequence, 
from base 1 to base 2732. The nucleotide coding region is shown 
from base 197-2065, the precursor form. The mature form is from 
nucleotide position 197-1402. Also shown is the mRNA leader, 
bases 1-196, and poly A tail of the mRNA, bases 2066-2732. Both 
the nucleotide and amino acid sequence are listed in SEQ ID NO 
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be produced. This aspect provides for great advantages in 
agricultural production of mass quantities of a preselected 
protein product. 

The invention further provides for the ability to screen 
for agonists or antagonists of the binding of RB47 to the RB47 
binding site using the expression systems as described herein. 
Antagonists of the binding are useful in the prevention of plant 
propagat ion . 

Also contemplated by the present invention is a screening 
assay for agonists or antagonists of RB60 in a manner analogous 
to that described above for RB47. Such agonists or antagonists 
would be useful in general to modify expression of RB60 as a way 
to regulate cellular processes in a redox manner. 

Kits containing expression cassettes and expression 
systems, along with packaging materials comprising a label with 
instructions for use, as described in the claimed embodiments 
are also contemplated for use in practicing the methods of this 
invention C 

Other uses will be apparent to one skilled in the art in 
light of the present disclosures. 

Brief Dftflnription of Drawings 

In the figures forming a portion of this disclosure: 
Figures 1A-1D show' the complete protein amino acid residue 
sequence of RB47 is shown from residues 1-623, together with the 
corresponding nucleic acid sequence encoding the RB4 7 sequence, 
from base 1 to base 2732. The nucleotide coding region is shown 
from base 197-2065, the precursor form. The mature form is from 
nucleotide position 197-1402. Also shown is the mRNA leader, 
bases 1-196, and poly A tail of the mRNA, bases 2066-2732. Both 
the nucleotide and amino acid sequence are listed in SEQ ID NO 
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5 . 

Figures 2A-2B show the complete protein amino acid residue 
sequence of RB60 is shown from residues 1-488, together with the 
corresponding nucleic acid sequence from base 1 to base 2413, of 
5 which bases 16-1614 encode the RB60 sequence. Both the 

nucleotide and amino acid sequence are listed in SEQ ID NO 10. 

Figures 3A-3C show the complete sequence of the psbA mRNA, 
showing both encoded psbA protein amino acid residue sequence 
(residues 1-352) and the nucleic acid sequence as further 
10 described in Example 3 is illustrated. Both the nucleotide and 

k i amino acid sequence are listed in SEQ ID NO 13 . 

±J 

LJ Figure 4 is a schematic diagram of an expression cassette 

* ■■- 

M containing on one transcription unit from 5* to 3 1 , a promoter 
m region derived from the psibA gene for encoding the Dl protein- 
ic from C. reinhardtii further containing a transcription 
H initiation site (TS) , the RB47 binding site, a region for 
M insertion of a foreign or heterologous coding region, a RB4 7 
*i coding region, a RB60 coding region, and the 3' flanking region 
^ containing transcription termination site (TS) , flanked by an 
20 origin of replication and selection marker. Restriction 

endonuclease sites for facilitating insertion of the independent 
genetic elements are indicated and further described in Example 
4A. 

Figures 5A-5B show the nucleotide and amino acid sequence 
25 of the RB47 molecule containing a histidine tag, the sequences 
of which are also listed in SEQ ID NO 14 . 

Figure 6 is a schematic diagram of an expression cassette 
containing on one transcription unit from 5 1 to 3', a promoter 
region derived from the psbA gene for encoding the Dl protein 
3 0 from C. reinhardtii further containing a transcription 

initiation site (TS) , the RB47 binding site, a region for 
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RB4 7 is also shown in Figures 1A-1D (SEQ ID NO 5) . As 
described in Section 2 above, the predicted protein 
sequence from the cloned cDNA contained both the derived 
peptide sequences of RB47 and is highly homologous to 
5 poly (A) binding proteins (PABP) from a variety of 
eukaryotic organisms. 

2. Cloning of RB6 0 

To clone the cDNA encoding the 60 kDa psJbA mRNA 

10 binding protein (RB6 0) , the psbA- specif ic RNA binding 

* c * proteins were purified from light-grown C. reinhardtii 

^ cells using heparin- agarose chromatography followed by 

M psbA RNA affinity chromatography (RAC) . RAC-purif ied 

\n proteins were separated by two-dimensional 

L: ±5 polyacrylamide gel electrophoresis. The region 

□ corresponding to RB6 0 was isolated from the PVDF 

il membrane. RB60 protein was then digested with trypsin. 

'-'i Unambiguous amino acid sequences were obtained from two 

\3 peptide tryptic fragments ( WFVDGELASDYNGPR (SEQ ID NO 6) 

2 0 and ( QL I LWTTADDLKADAE I MTVFR (SEQ ID NO 7) ) as described 

above for RB47. The calculated molecular weights of the 
two tryptic peptides used for further analysis precisely 
matched with the molecular weights determine by mass 
spectrometry. The DNA sequence corresponding to one 
25 peptide of 22 amino acid residues was amplified by PGR 
using degenerate oligonucleotides, the forward primer 
5 1 CGCGGATCCGAYGCBGAGATYATGAC3 1 (SEQ ID NO 8) and the 
reverse primer 5 1 CGCGAATTCGTCATRATCTCVGCRTC3 1 (SEQ ID NO 
9) , where R can be A or G (the other IUPAC nucleotides 

3 0 have been previously defined above) . The amplified 

se q Uence was then used to screen a X-gtlO cDNA library 
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from C. reinhardtii . Three clones were identified with 
the largest being 2.2 kb. Selection and sequencing was 
performed as described for RB47 cDNA. 

The resulting RB60 cDNA sequence is available via 
5 GenBank (Accession Number AF027727) . The nucleotide and 
encoded amino acid sequence of RB60 is also shown in 
Figures 2A-2B (SEQ ID NO 10) . The protein coding 
sequence of 488 amino acid residues corresponds to 
nucleotide positions 16-1614 of the 2413 base pair 
..." 10 sequence. The predicted amino acid sequence of the 
*| cloned cDNA contained the complete amino acid sequences 

of the two tryptic peptides. The amino acid sequence of 
| = : the encoded protein revealed that it has high sequence 
HI homology to both plant and mammalian protein disulfide 
"l5 isomerase (PDI) , and contains the highly conserved 
H thioredoxin-like domains with -CysGlyHisCys- (-CGHC-) 
M (SEQ ID NO 11) catalytic sites in both the N- terminal 
^ and C- terminal regions and the -LysAspGluLeu- (-KDEL-) 
^ (SEQ ID NO 12) endoplasmic reticulum (ER) retention 
20 signal at the C- terminus found in all PDIs. PDI is a 
mutif unctional protein possessing enzymatic activities 
for the formation, reduction, and isomerization of 
disulfide bonds during protein folding, and is typically 
found in the ER. The first 3 0 amino acid residues of 

2 5 RB6 0 were found to lack sequence homology with the 

N-terminal signal sequence of PDI from plants or 
mammalian cells. However, this region has 
characteristics of chloroplast transit peptides of C. 
reinhardtii , which have similarities with both 

3 0 mitochondrial and higher plant chloroplast presequences . 

A transit peptide sequence should override the function 
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of the -KDEL- ER retention signal and target the protein 
to the chloroplast since the -KDEL- signal acts only to 
retain the transported protein in the ER. 



5 3 . Preparation of P -qhA Promoter Sequence and RB47 
binding S ite Nucleotide Sequence 
The chloroplast psbA gene from the green 
unicellular alga C. reinhardii was cloned and sequenced 
as described by Erickson et al . , Embo J . . 3:2753-2762 
_ 10 (1984), the disclosure of which is hereby incorporated 
by reference. The DNA sequence of the coding regions 



and the 5 1 and 3 1 untranslated (UTR) flanking sequences 
of the C. reinhardii psJbA gene is shown in Figures 3A- 



°3 nucleotide sequence is also listed as SEQ ID NO 13. The 
■i deduced amino acid sequence (also listed in SEQ ID NO 
! = 13) of the coding region is shown below each codon 
=2 beginning with the first methionine in the open reading 
20 frame. Indicated in the 5 f non-coding sequence are a 

putative Shine -Dalgarno sequence in the dotted box, two 
putative transcription initiation sites determined by SI 
mapping (SI) and the Pribnow-10 sequence in the closed 
box. Inverted repeats of eight or more base pairs are 
25 marked with arrows and labeled A-D. A direct repeat of 
31 base pairs with only two mismatches is marked with 
arrows labeled 31. Indicated in the 3' non-coding 
sequence is a large inverted repeat marked by a forward 
arrow and the SI cleavage site marking the 3 ■ end of the 
3 0 mRNA. Both the 5' and 3 1 untranslated regions are used 
in preparing one of the expression cassettes of this 



[n 3c. The psJbA gene sequence is also available through 
T5 GenBank as further discussed in Example 4 . The 
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invention as further described below. 

The 5' UTR as previously discussed contains both 
the psjbA promoter and the RB47 binding site. The 
nucleotide sequence defining the psbA promoter contains 
the region of the psbA DNA involved in binding of RNA 
polymerase to initiate transcription. The -10 sequence 
component of the psbA promoter is indicated by the boxed 
nucleotide sequence upstream of the first SI while the - 
3 5 sequence is located approximately 3 5 bases before the 
putative initiation site. As shown in Figures 3A-3C, 
the -10 sequence is boxed, above which is the nucleotide 
position (-100) from the first translated codon. The - 
3 5 sequence is determined accordingly. A psJbA promoter 
for use in an expression cassette of this invention ends 
at the first indicated SI site (nucleotide position -92 
as counting from the first ATG) in Figures 3A-3C and 
extends to the 5* end (nucleotide position -251 as shown 
in Figures 3A-3C) . Thus, the promoter region is 160 
bases in length. A more preferred promoter region 
extends at least 100 nucleotides to the 5 1 end from the 
SI site. A most preferred region contains nucleotide 
sequence ending at the si site and extending 5 1 to 
include the -35 sequence, i.e., from -92 to -130 as 
counted from the first encoded amino acid residue (39 
bases) . 

The psbA RB47 binding site region begins at the 
first SI site as shown in Figures 3A-3C and extends to 
the first adenine base of the first encoded methionine 
residue. Thus, a psbA RB47 binding site in the psbA 
gene corresponds to the nucleotide positions from -91 to 
-1 as shown in Figure 3A-3C. 
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The above -identified regions are used to prepare 
expression constructs as described below. The promoter 
and RB47 binding site regions can be used separately; 
for example, the RB4 7 binding site sequence can be 
5 isolated and used in a eukaryotic or prokaryotic plasmid 
with a non-psbA promoter. Alternatively, the entire 
psbA 5 ! UTR having 251 nucleotides as shown in Figures 
3A-3C is used for the regulatory region in an expression 
cassette containing both the psbA promoter and RB47 
10 binding site sequence as described below. 

LJ 4 . Preparation of E xpression Vectors and Expression of 

u Coding Sequences 

^ A. Construc ts Containing an usbA Promoter, an 

i% RB47 Bin ding Site Nucleotide Sequence, a 



20 alternatively called plasmids, vectors, constructs and 
the like, are constructed containing various 
combinations of elements of the present invention as 
described in the following examples. Variations of the 



25 described in the present invention and in the examples 

below are contemplated for use in practicing the methods 
of this invention. Methods for manipulating DNA 
elements into operable expression cassettes are well 
known in the art of molecular biology. Accordingly, 

30 variations of control elements, such as constitutive or 
inducible promoters, with respect to prokaryotic or 



Desired Heterologous C oding Sequence, an RB47- 
Encoding Sequence and an RB60 -Encoding 



Sequence 



Plasmid expression vector constructs, 



positioning and operably linking of the genetic elements 
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eukaryotic expression systems as described in Section C. 
are contemplated herein although not enumerated. 
Moreover, the expression the various elements is not 
limited to one transcript producing one mRNA; the 
5 invention contemplates protein expression from more than 
one transcript if desired. 

As such, while the examples below recite one or two 
types of expression cassettes, the genetic elements of 
RB4 7 binding site, any desired coding sequence, in 
10 combination with RB47 and RB60 coding sequences along 
h % with a promoter are readily combined in a number of 
^ operably linked permeations depending on the 
M requirements of the cell system selected for the 
m expression. For example, for expression in a 
L l5 chloroplast, endogenous RB47 protein is present 
C3 therefore an expression cassette having an RB47 binding 
u site and a desired coding sequence is minimally required 
s iJ along with an operative promoter sequence. 
*3 Overexpression of RB47 may be preferable to enhance the 
20 translation of the coding sequence; in that case, the 
chloroplast is further transformed with an expression 
cassette containing an RB47-encoding sequence. Although 
the examples herein and below utilize primarily the 
sequence encoding the precursor form of RB47, any of the 
25 RB47-encoding sequences described in the present 
invention, i.e., RB4 7 precursor, mature RB4 7 and 
histidine-modif ied RB47 are contemplated for use in any 
expression cassette and system as described herein. To 
regulate the activation of translation, an RB60-encoding 
3 0 element is provided to the expression system to provide 
the ability to regulate redox potential in the cell as 
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taught in Section B. These examples herein and below 
represent a few of the possible permutations of genetic 
elements for expression in the methods of this 



invention. 



5 



In one embodiment, a plasmid is constructed 



containing an RB47 binding site directly upstream of an 
inserted coding region for a heterologous protein of 
interest, and the RB47 and RB60 coding regions. 
Heterologous refers to the nature of the coding region 
10 being dissimilar and not from the same gene as the 
' s % regulatory molecules in the plasmid, such as RB47 and 
L ^ RB60. Thus, all the genetic elements of the present 
M invention are produced in one transcript from the IPTG- 
\n inducible psbA promoter. Alternative promoters are 
t5 similarly acceptable. 

[3 The final construct described herein for use in a 

u prokaryotic expression system makes a single mRNA from 
which all three proteins are translated. The starting 
*2 plasmid is any E. coli based plasmid containing an 
20 origin of replication and selectable marker gene. For 
this example, the Bluescript plasmid, pBS, commercially 
available through Stratagene, Inc., La Jolla, CA, which 
contains a polylinker-cloning site and an ampicilin 
resistant marker is selected for the vector. 
25 The wild-type or native psJbA gene (Erickson et al . , 

Embo J . , 3:2753-2762 (1984), also shown in Figures 3A- 
3C, is cloned into pBS at the EcoRI and BamHI sites of 
the polylinker. The nucleotide sequence of the psbA 
gene is available on GenBank with the 5* UTR and 3' UTR 
30 respectively listed in Accession Numbers X01424 and 

X02350. The EcoRI site of psJbA is 1.5 kb upstream of 
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the psbA initiation codon and the BamHI site is 2 kb 



downstream of the stop codon. This plasmid is referred 
to as pDl . 

Using site-directed PCR mutagenesis, well known to 
5 one of ordinary skill in the art, an Ndel site is placed 
at the initiation codon of psJbA in the pDl plasmid so 
that the ATG of the Ndel restriction site is the ATG 
initiation codon. This plasmid is referred to as 
pDl/Nde. An Nde site is then placed at the initiation 
- 10 codon of the gene encoding the heterologous protein of 

interest and an Xho I site is placed directly downstream 
W (within 10 nucleotides) of the TAA stop codon of the 
u heterologous protein coding sequence. Again using site- 
^ directed mutagenesis, an Xhol site is placed within 10 
-Is nucleotides of the initiation codon of RB47, the 
n preparation of which is described in Example 2, and an 
L i NotI site is placed directly downstream of the stop 
W codon of RB47. The heterologous coding region and the 
u RB47 gene are then ligated into pDl/Nde so that the 
20 heterologous protein gene is directly adjacent to the 
RB4 7 binding site and the RB47 coding region is 
downstream of the heterologous coding region, using the 
Xho I site at the heterologous stop codon and the Not I 
site of the pDl polylinker. 
25 These genetic manipulations result in a plasmid 

containing the 5 ' end of the psbA gene including the 
promoter region and with the RB47 binding site 
immediately upstream of a heterologous coding region, 
and the RB47 coding region immediately downstream of the 
3 0 heterologous coding region. The nucleotides between the 
stop codon of the heterologous coding region and the 
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initiation codon of the RB47 coding region is preferably 
less than 20 nucleotides and preferably does not contain 
any additional stop codons in any reading frame. This 
plasmid is referred to as pDl/RB47. 
5 Using site-directed mutagenesis, a Not I site is 

placed immediately (within 10 nucleotides) upstream of 
the initiation codon of RB60, the preparation of which 
is described in Example 2, and an Xba I site is placed 
downstream of the RB60 stop codon. This DNA fragment is 
— 10 then ligated to the 3 ' end of the psbA gene using the 

ryss 

"5 Xba I site found in the 3 1 end of the psJbA gene so that 
^ the psJbA 3 1 end is downstream of the RB60 coding region. 
M This fragment is then ligated into the pDl/RB47 plasmid 
Lil using the Not I and BamHI sites so that the RB6 0 coding 
"is region directly follows the RB47 coding region. The 
"3 resulting plasmid is designated pDl/RB47/RB60 . 
u Preferably there is less then 20 nucleotides between the 
^ RB4 7 and RB60 coding regions and preferably there are no 

stop codons in any reading frame in that region. The 
20 final plasmid thus contains the following genetic 

elements operably linked in the 5 ! to 3 ' direction: the 
5 1 end of the psJbA gene with a promoter capable of 
directing transcription in chloroplasts , an RB47 binding 
site, a desired heterologous coding region, the RB4 7 
25 coding region, the RB60 coding region, and the 3 f end of 
the psbA gene which contains a transcription termination 
and mRNA processing site, and an E. coli origin of 
replication and amplicillin resistance gene. A diagram 
of this plasmid with the restriction sites is shown in 
3 0 Figure 4 . 

Expression of pDl/RB47/RB60 in E* coli to produce 
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recombinant RB4 7, RB60 and the recombinant heterologous 
protein is performed as described in Example 4B. The 
heterologous protein is then purified as further 
described. 

5 Expression cassettes in which the sequences 

encoding RB47 and RB60 are similarly operably linked to 
a heterologous coding sequence having the psbA RB4 7 
binding site as described in Example 3 are prepared with 
a different promoter for use in eukaryotic, such as 
10 mammalian expression systems. In this aspect, the 

cassette is similarly prepared as described above with 
L !r the exception that restriction cloning sites are 
M dependent upon the available multiple cloning sites in 
[rj the recipient vector. Thus, the RB47 binding site 
;f5 prepared in Example 3 is prepared for directed ligation 
< 3 into a selected expression vector downstream of the 
=i promoter in that vector. The RB4 7 and RB60 coding 
% sequences are obtained from the pDl/RB47/RB60 plasmid by 
^ digestion with Xhol and Xbal and inserted into a 

similarly digested vector if the sites are present. 
Alternatively, site-directed mutagenesis is utilized to 
create appropriate linkers. A desired heterologous 
coding sequence is similarly ligated into the vector for 
expression. 



b . construct Conta ining RB47 Nucleotide Sequence 

1) Purified Recombinant RB47 Protein 
In one approach to obtain purified 
recombinant RB47 protein, the full length RB47 cDNA 
prepared above was cloned into the E. coli expression 
vector pET3A (Studier et al . , Methods Enzvmol . , 185:60- 



AMENDEDaHEEi ryY 



# •Mft 9 3/00*4(1 

- 69 - 1WVUS : 7 AUG 1998 

89 (1990) ) , also commercially available by Novagen, 
Inc., Madison, WI and transformed into BL21 E. coli 
cells. The cells were grown to a density of 0/4 (OD 600 ) , 
then induced with 0.5 mM IPTG. Cells were then allowed 
5 to grow for an additional 4 hours, at which point they 
were pelleted and frozen. 

Confirmation of the identity of the cloned cDNA as 
encoding the authentic RB47 protein was accomplished by 
examining protein expressed from the cDNA by immunoblot 
10 analysis and by RNA binding activity assay. The 
* L i recombinant RB4 7 protein produced when the RB4 7 cDNA was 
^ expressed was recognized by antisera raised against the 
M C. reinhardtii RB47 protein. The E. coli expressed 
lH protein migrated at 80 kDa on SDS-PAGE, but the protein 
4:5 was actually 6 9 kDa, as determined by mass spectrometry 

of the E. coli expressed protein. This mass agrees with 
U the mass predicted from the cDNA sequence. A 60 kDa 
ht i product was also produced in E. coli, and recognized by 
*3 the antisera against the C. reinhardtii protein, which 

2 0 is most likely a degradation or early termination 

product of the RB47 cDNA. The recombinant RB4 7 protein 
expressed from the RB47 cDNA is recognized by the 
antisera raised against the C. reinhardtii protein at 
levels similar to the recognition of the authentic C. 
25 reinhardtii RB47 protein, demonstrating that the cloned 
cDNA produces a protein product that is immunologically 
related to the naturally produced RB47 protein. In 
order to generate a recombinant equivalent of the 
endogenous native RB4 7, the location of the 4 7 kDa 

3 0 polypeptide was mapped on the full-length recombinant 

protein by comparing mass spectrometric data of tryptic 
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digests of the C. reinhardtii 47 kDa protein and the 
full-length recombinant protein. Thus, peptide mapping 
by mass spectrometry has shown that the endogenous RB4 7 
protein corresponds primarily to the RNA binding domains 
. 5 contained within the N- terminal region of the predicted 
precursor protein, suggesting that a cleavage event is 
necessary to produce the mature 47 kDa protein. Thus, 
full-length recombinant RB47 is 69 kDa and contains a 
carboxy domain that is cleaved in vivo to generate the 
- 10 endogenous mature form of RB47 that is 47 kDa. 
^ To determine if the heterologously expressed RB47 

W protein was capable of binding the psbA RNA, the E . coli 
M expressed protein was purified by heparin agarose 
m chromatography. The recombinant RB47 protein expressed 
^"15 in E . coli was purified using a protocol similar to that 
□ used previously for purification of RB47 from C. 
il reinhardtii . Approximately 5 g of E. coli cells grown 

as described above were resuspended in low salt 
>3 extraction buffer (10 mM Tris [pH 7.5], 10 mM NaCl, 10 
20 mM MgCl 2/ 5 mM S-mercaptoethanol) and disrupted by 

sonication. The soluble cell extract was applied to a 5 
mL Econo-Pac heparin cartridge (Bio-Rad) which was 
washed prior to elution of the RB47 protein (Danon and 
Mayfield, Embo J . , 10:3993-4001 (1991)). 
25 The E. coli expressed protein that bound to the 

heparin agarose matrix was eluted from the column at the 
same salt concentration as used to elute the authentic 
C. reinhardtii RB47 protein. This protein fraction was 
used in in vitro binding assays with the psbA 5 1 UTR. 
30 Both the 69 and 60 kDa E. coli expressed proteins 

crosslinked to the radiolabeled psbA 5' UTR at levels 
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similar to crosslinking of the endogenous RB47 protein, 
when the RNA/protein complex is subjected to UV 
irradiation . 

Heparin agarose purified proteins, both from the E. 
5 coli expressed RB47 cDNA and from C. reinhardtii cells, 
were used in an RNA gel mobility shift assay to 
determine the relative affinity and specificity of these 
proteins for the 5' UTR of the psJbA mRNA. The E. coli 
expressed proteins bound to the psJbA 5» UTR in vitro 

10 with properties that are similar to those of the 

% endogenous RB47 protein purified from C. reinhardtii . 
*Z RNA binding to both the E. coli expressed and the 
M endogenous RB4 7 protein was competed using either 200 
m fold excess of unlabeled psJbA RNA or 200 fold excess of 
T5 poly (A) RNA. RNA binding to either of these proteins 
!3 was poorly competed using 200 fold excess of total RNA 
U or 200 fold excess of the 5 1 UTR of the psJbD or psbC 

1 1 RNAs . Different forms of the RB47 protein (47 kDa 
*3 endogenous protein vs. the 69 kDa E. coli expressed 

2 0 protein) may account for the slight differences in 

mobility observed when comparing the binding profiles of 
purified C. reinhardtii protein to heterologously 
expressed RB47. 

The mature form of RB47 is also produced in 

2 5 recombinant form by the insertion by PCR of an 

artificial stop codon in the RB47 cDNA at nucleotide 
positions 1403-1405 with a stop codon resulting in a 
mature RB4 7 recombinant protein having 4 02 amino acids 
as shown in Figures 1A-1D. An example of this is shown 

3 0 in Figures 5A-5B for the production of a recombinant 

histidine-modif ied RB47 mature protein as described 
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below. The complete RB47 cDNA is inserted into an 

expression vector, such as pET3A as described above, for 

expression of the mature 4 7 kDa form of the RB47 

protein. In the absence of the inserted stop codon, the 
5 transcript reads through to nucleotide position 2066- 

2 068 at the TAA stop codon to produce the precursor RB4 7 

having the above -described molecular weight 

characteristics and 623 amino acid residues. 

Recombinant RB47 is also expressed and purified in 
10 plant cells. For this aspect, C. reinhardtii strains 
3 were grown in complete media (Tris-acetate-phosphate 
'% [TAP] (Harris, The Chlamydonas Sourcebook, San Diego, 
fl CA, Academic Press (1989)) to a density of 5x10 s cells/mL 
in under constant light. Cells were harvested by 
lis centrifugation at 4°C for 5 minutes at 4,000 g. Cells 
R were either used immediately or frozen in liquid N 3 for 
M storage at -70°C. 

*3 Recombinant RB47 protein was also produced as a 

' 3 modified RB47 protein with a histidine tag at the amino- 
20 terminus according to well known expression methods 
using pET19-D vectors available from Novagen, Inc., 
Madison, WI . The nucleotide and amino acid sequence of 
a recombinant histidine-modif ied RB47 of the mature 47 
kDa form is shown in Figures 5A-5B with the nucleotide 
25 and amino acid sequence also listed in SEQ ID NO 14. 
Thus the nucleotide sequence of a histidine-modif ied 
RB47 is 1269 bases in length. The precursor form of the 
RB47 protein is similarly obtained in the expression 
system, both of which are modified by the presence of a 
3 0 histidine tag that allows for purification by metal 
affinity chromatography. 
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The recombinant histidine-modif ied RB47 purified 
through addition of a poly-histidine tag followed by Ni* 2 
column chromatography showed similar binding 
characteristics as that described for recombinant 
*5 precursor RB47 described above. 

c. nnnfltrucr .fl Containing RB60 Nucleotide Sequence 
In one approach to obtain purified recombinant 

RB60 protein, the full-length RB60 cDNA prepared above 
10 was cloned into the E. coli expression vector pET3A 
^ (Studier et al . , Methods Enzvmol . . 185:60-89 (1990)), 
s - also commercially available by Novagen, Inc., Madison, 
s t WI and transformed into BL21 E. coll cells. The cells 
■H were grown to a density of 0.4 (OD fi00 ) , then induced with 
15 0.5 mM IPTG. Cells were then allowed to grow for an 
R additional 4 hours, at which point they were pelleted 
s * and frozen. 

Recombinant histidine-modif ied RB60 was also 
^ expressed with a pET19-D vector as described above for 
20 RB47 that was similarly modified. Purification of the 

recombinant RB60 proteins was performed as described for 

RB4 7 thereby producing recombinant RB60 proteins for use 

in the present invention. 

The RB6 0 coding sequence is also mutagenized for 
25 directional ligation into an selected vector for 

expression in alternative systems, such as mammalian 

expression systems . 

D. Construc ts Containing an RB4 7 -Encoding 
3 0 sequence and an RB6 0 -Encoding Sequence 

To prepare an expression cassette for encoding 
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both RB47 and RB60, one approach is to digest plasmid 
pDl/RB47/RB60 prepared above with Xhol and Xbal to 
isolate the fragment for both encoding sequences. The 
fragment is then inserted into a similarly digested 
'5 expression vector if available or is further mutagenized 
to prepare appropriate restriction sites. 

Alternatively, the nucleotide sequences of RB47 and 
RB60, as described in Example 2, are separately prepared 
for directional ligation into a selected vector. 
10 An additional embodiment of the present invention 

3 is to prepare an expression cassette containing the RB4 7 

binding site along with the coding sequences for RB47 
\t and RB60, the plasmid pDl/RB47/RB60 prepared above is 
in digested with Ndel and Xhol to prepare an expression 
JL5 cassette in which any desired coding sequence having 
H similarly restriction sites is directionally ligated. 
M Expression vectors containing both the RB4 7 and RB6 0 
*3 encoding sequences in which the RB47 binding site 
^ sequence is utilized with a different promoter are also 
2 0 prepared as described in Example 4A. 

E. rnnatruc f fl Containing an RB47 Binding Site 
Nnnleotidfi Sequen ce. Insertion Sites for a 

nesired H e terologous Coding Sequent , and an 
25 PR47-Encodina Secruence 

In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
region for a heterologous protein of interest, and the 
3 0 RB47 coding region. The final construct described 

herein for use in a prokaryotic expression system makes 
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a single mRNA from which both proteins are translated. 

The plasmid referred to as pDl/RB47 is prepared as 
described above in Example 4A. A diagram of this 
plasmid with the restriction sites is shown in Figure 6. 
5 Expression of pDl/RB47 in E. coli to produce 

recombinant RB4 7 and the recombinant heterologous 
protein is performed as described in above. The 
heterologous protein is then purified as further 
described. 

10 To produce an expression cassette that allows for 

*3 insertion of an alternative desired coding sequence, the 
x % plasmid pDl/RB47 is digested with Ndel and Xhol 
*\i resulting in a vector having restriction endonuclease 
Ln sites for insertion of a desired coding sequence 
lis operably linked to a RB47 binding site and RB47 coding 
H sequence on one transcriptional unxt. 

J J F . rnnfltruc t fl Containing an RB47 Binding Site 

^ Wucleoti Hft fifiguftnflft 1 . Insertion Sites for a 

20 Dfisired Heterolo gous Coding Sequence, and an 

RR4 7-EncoHina Sequence 
In another permutation, a plasmid or 
expression cassette is constructed containing an RB47 
binding site directly upstream of an inserted coding 
2 5 region for a heterologous protein of interest, and the 
RB6 0 coding region. The final construct described 
herein for use in a prokaryotic expression system makes 
a single mRNA from which both proteins are translated. 
In this embodiment, a separate construct encoding 
30 recombinant RB47 as described in Example 4B is co- 
transformed into the E. coli host cell for expression. 
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The plasmid referred to as pDl/RB60 is prepared as 
described above for pDl/RB47 in Example 4A with the 
exception that Xhol and Xbal sites are created on RB60 
rather than RB4 7. 

Expression of pDl/RB60 in E. coli to produce 
recombinant RB60 and the recombinant heterologous 
protein is performed as described in above with the 
combined expression of RB47 from a separate expression 
cassette- The heterologous protein is then purified as 
further described. 

To produce an expression cassette that allows for 
insertion of an alternative desired coding sequence, the 
plasmid pDl/RB60 is digested with Ndel and Xhol 
resulting in a vector having restriction endonuclease 
sites for insertion of a desired coding sequence 
operably linked to a RB47 binding site and RB60 coding 
sequence on one transcriptional unit. 

G . Construct Containing RB47 Binding Site 

T^lftnt-.Tde Secn ifince and Heterologous Coding 
Sequences 

1) Ryprflflaion of Recombinant Tftf.qmas TQX\n 



that are variations of those described above. The 
constructs described below contain an RB47 binding site 



activating protein RB47 was endogenously provided in the 
chloroplast and or plant cell. In other aspects however 
as taught by the methods of the present invention, the 
chloroplast is further transformed with an RB47- 



fiinale Chain Antibody 

The examples herein describe constructs 



sequence and a heterologous coding sequence. The 
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expression construct as described above for 
overexpression of RB47 to enhance translation 
capacities . 

A strain of the green algae Chlamydomonas 
reinhardtii was designed to allow expression of a single 
chain antibody gene in the chloroplast . The 
transgenically expressed antibody was produced from a 
chimeric gene containing the promoter and 5 ' 
untranslated region (UTR) of the chloroplast psbA gene 
prepared as described above, followed by the coding 
region of a single chain antibody (encoding a tetanus 
toxin binding antibody), and then the 3 1 UTR of the psbA 
gene also prepared as described above to provide for 
transcription termination and RNA processing signals. 
This construct is essentially pDl/Nde including a 
heterologous coding sequence having a 3 1 Xbal 
restriction site for ligation with the 3' psbA gene and 
is diagramed in Figure 7 . 

The psbA- single chain construct was first 
transformed into C. reinhardtii chloroplast and 
transf ormants were then screened for single chain gene 
integration. Transformation of chloroplast was 
performed via bolistic delivery as described in US 
Patents 5,545,818 and 5,553,878, the disclosures of 
which are hereby incorporated by reference . 
Transformation is accomplished by homologous 
recombination via the 5* and 3' UTR of the psbA mRNA. 

As shown in Figure 8, two of the transf ormants that 
contained the single chain chimeric gene produced single 
chain antibodies at approximately 1% of total protein 
levels. The transgenic antibodies were of the correct 
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size and were completely soluble, as would be expected 
of a correctly folded protein. Few degradation 
products were detectable by this Western analysis, 
suggesting that the proteins were fairly stable within 
5 the chloroplast. To identify if the produced antibody 
retained the binding capacity for tetanus toxin, EL ISA 
assays were performed using a mouse -produced Fab, from 
the original tetanus toxin antibody, as the control . 
The chloroplast single chain antibody bound tetanus 
10 toxin at levels similar to Fab, indicating that the 
"5 single chain antibody produced in C. reinhardtii is a 
H fully functional antibody. These results clearly 
M demonstrate the ability of the chloroplast to synthesis 
[n and accumulate function antibody molecules resulting 
^5 from the translational activation of an RB47 binding 
H site in an expression cassette by endogenous RB47 
m protein in the chloroplast . 

Is 'ZZ 

_ U 2) Repressio n of Bacterial Luciferase Enzvme 

— 20 Having Two Subunits 

For the production of molecules that 
contain more than one subunit, such as dlgA and 
bacterial luciferase enzyme, several proteins must be 
produced in stoichiometric quantities within the 
25 chloroplast. Chloroplast have an advantage for this 

type of production over cytoplasmic protein synthesis in 
that translation of multiple proteins can originate from 
a single mRNA. For example, a dicistronic mRNA having 
5 1 and 3' Ndel and Xbal restriction sites and containing 
30 both the A and B chains of the bacterial luciferase 

enzyme was inserted downstream of the psbA promoter and 
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5 1 UTR of the pDl/Nde construct prepared in Example 4A 
above. In this construct, the bacterial LuxAB coding 
region was ligated between the psbA 5' UTR and the psbA 
3 • end in an E. coli plasmid that was then transformed 
5 into Chlamydomonas reinhardtii cells as described above 
for expression in the chloroplast. A schematic of the 
construct is shown in Figure 9. Single transformant 
colonies were then isolated. A plate containing a 
single isolate was grown for 10 days on complete media 
10 and a drop of the luciferase substrate n-Decyl Aldehyde 
k i was placed on the plate and the luciferase visualized by 
^ video-photography in a dark chamber. Both proteins were 
M synthesized from this single mRNA and luciferase 
m activity accumulated within the chloroplast as shown in 
its Figure 10. Some mRNA within plastids contained as many 
H as 5 separate proteins encoded on a single mRNA. 



2 0 shown in Figure 11 is engineered so that the psbA 

promoter and 5' UTR are used to drive the synthesis of 
the light chain and heavy chains of an antibody, and the 
J chain normally associated with IgA molecules. The 
nucleic acid sequences for the dimeric IgA are inserted 

25 into the RB47 binding site construct prepared in Example 
4A. The construct is then transformed into C. 
reinhardtii cells as previously described for expression 
of the recombinant dlgA. 



30 plastid allows for the self assembly of a dimeric IgA 



Expressi on of Dimeric IgA 

To generate dimeric IgA, the construct 



Production of these three proteins within the 



(dlgA) . Production of this complex is monitored in 
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